Europaisches Patentamt 

® X)))} EUr ° Pean Patem m%CB ©Publication number: 0 076 687 

Office europeen des brevets A1 



© EUROPEAN PATENT APPLICATION 

© Application number: 82305275.8 © int. CI. 3 : 6 10 L 1/00 

© Date of filing : 04.10.82 



© Priority: 05.10.81 US 308273 



@ Date of publication of application : 
13.04.83 Bulletin 83/15 

© Designated Contracting States: 
DE FR GB IT NL SE 



©Applicant: SIGN ATRON, INC. 
12, Hartwell Avenue 
Lexington. MA 02173{US) 

© Inventor: Kates, James M. 
7, Lincoln Street 

Andover Massachusetts 01810(US) 

© Representative: Jackson, David Spence et al, 
REDDIE & GROSE 16, Theobalds Road 
London, WC1X8PUGB) 



© Speech intelligibility enhancement system and method. 

© To enhance the intelligibility of speech, the consonant 
sounds are intensified and, in effect, their intensity equalised 
to that of the vowel sounds in a speech waveform. A 
short-time estimate of the relative spectral shape of an input 
speech signal is determined by envelope detectors (24| 
operating on the outputs of band pass filters (20). Control 
means are provided to respond to such relative spectral 
shape estimate by dynamically controlling a modification of 
the spectral shape of the actual speech signal so as to 
produce a modified output speech signal, the control means 
comprising a combination matrix (28) operating on the 
Y- outputs of the envelope detectors (24) with a matrix of 
coefficients and producing weighted signals (29) as control 
signals. The control signals (29) act on gain selecting logic 
JS (30) to determine the gains of multipliers (31) through which 
$Q respective different portions of the frequency spectrum of 
<0 the input speech are coupled to a summation circuit (32) 
producing the consonant - enhanced output speech signal, 
(D the respective different portions of the frequency spectrum 
|S of the input speech being produced by a bank of filters (20) 
O , supplying the envelope detectors (24) or by a set of different 
_ filters (26). 



INPUT 
SPEECH^ 
SGNAL 







/" 


SPECTWftL 
SHAPE 


ESTIMATION 


CONTROL 


ESTIMATE 


SIGNAL15) 


lOGC 



CONTROL 
SIGNAL tS) 



DYNAMIC 
SPECTRAL 

MODIFtOmON 



ENHANCED 
OUTPUT 
~ SPEECH 
SIGNAL 



FIG. I 



0. 

UJ 



Croydon Printing Company Ltd. 



BNSDOCID; <EP 0076687A1 J_> 



0076687 

-l- 

SPEECH INTELLIGIBILITY ENHANCEMENT SYSTEM AND METHOD 

This invention relates generally to the enhance- 
ment of the intelligibility of speech and more partic- 
ularly to the enhancement of the consonant sounds of 
speech. 

It is desirable in many applications to enchance 
the intelligibility of speech when the speech has been 
processed electronically as, for example, in hearing 
aids, public address systems, radio or telephone 
communications, and the like. Although it is helpful 
to enhance the presentation of both vowel and consonant 
sounds, generally it appears that, since the intelligi- 
bility characteristics of speech depend to such a 
significant extent on consonant sounds, it is primarily 
desirable to enhance the intelligibility of such 
consonants * 

Several approaches have characterised recent 
research into such intelligibility problems, particularly 
with respect to the hearing aid field. One approach has 
been to take the high frequency sounds in speech and 
transpose them to lower frequencies so that they fall 
within the band of normal hearing acuity, leaving the 
low frequency sounds unprocessed. Such approaches are 
discussed, for example, in the article "A Critical 
Review of Work on Speech Analysing Hearing Aids" by 
A. Risberg, IEEE Trans. Audio and Electroacoustics , Vol. 
AU-17. No. 4, December 1969, pp. 290-29 7. The degree 
of success of such an approach appears to be quite 
limited and overall improvement in perceiving consonants, 
for example, was relatively small. 

An alternative approach, akin to the frequency 
lowering technique, has been to slow down the overall 
speech, i.e., to lower the frequencies of the overall 
speech waveform thereby presenting the higher frequency 
content at lower frequencies within the listener's 
normal hearing band. If such e technique is used in 
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real time, segments of the speech have to be removed 
in order to make room for the remaining temporally 
expanded segments and such process can generate 
distortion in the speech. Such techniques are discussed 
in the article "Moderate Frequency Compression for the 
Moderately Hearing Impaired", M. Mazor et al., J. Acoust. 
Soc. Am., Vol. 62, No. 5, November 1977, pp. 1273-1278. 
Although some slight improvement has been observed using 
such frequency compression techniques for up to about 
20% frequency compression, for example, it was also 
noted that a further increase in frequency compression 
only tended to reduce intelligibility. 

A basic problem with both high frequency transpos- 
ition techniques and frequency compression schemes is 
that they tend to distort the temporal-frequency 
patterns of speech. Such distortion interferes with 
the cues needed by the listener to perceive the speech 
features. As a result such approaches tend to meet with 
only limited success in enhancing speech intelligibility. 

Another approach to speech intelligibility 
enhancement is one which preserves the bandwidth of the 
speech and, instead, modifies the level and dynamic 
range of the speech waveform. The goal of such a 
speech processing approach is to make full use of 
the listener's high frequency hearing abilities. The 
hearing abilities of the hearing impaired are described, 
for example, in the article, "Differences in Loudness 
Response of the Normal and Hard of Hearing Ear at 
Intensity Levels Slightly above Threshold", by 
S. Reger, Ann. Otol., Rhinol., and Laryngol-, Vol. 45, 
1936, pp. 1029-1036. In this study of hearing impair- 
ment it was noted that soft sounds could not be 
perceived because of the loss in sensitivity, but that 
more intense sounds were perceived as having near- 
normal loudness. This phenomenon, sometimes referred 
to as "recruitment", has formed a motivation for 
improved hearing aid designs. Thus, an approach that 
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tends to preserve the speech bandwidth and improves 
intelligibility by modifying the speech waveform 
dynamics and spectral energy appears to be a more 
effective approach than frequency transposition or 
5 frequency compression techniques because the features 
of the speech are better preserved. Although such an 
approach has achieved some success, as reported in the 
article "Signal Processing to Improve Speech Intellig- 
ibility for the Hearing Impaired" by E. Villchur, 

10 J. Acoust. Soc. Am., Vol. 53, pp. 1646-1657, June 1973, 
improvement is still needed to provide the most 
effective enhancement of the intelligibility of speech, 
particularly in the enhancement of consonant sounds. 

The system of the invention provides an improved 

15 and effective enhancement of the reproduction of 

consonant sounds by emphasising the spectral content of 
consonants so as to intensify the consonant sound and, 
in effect, to equalise its intensity with that of 
vowel sounds, the latter sounds tending to achieve a 

20 normal intensity much greater than the normal consonant 
intensity. In accordance with the broadest approach 
of the invention, the system thereof processes an input 
speech signal by determining a short-time estimate of 
the spectral shape. The term "spectral shape" as used 

25 herein is intended to mean the spectral content of 
the input speech signal as a function of frequency 
relative to the spectral content at a specified 
frequency/ or a specified frequency region, of the input 
speech signal. The term "spectral content" is intended 

30 to mean, for example, the energy content of the signal 
as a function of frequency, the envelope of the signal 
at a plurality of frequencies or in a plurality of 
frequency bands, the short- time Fourier transform 
coefficients of the signal, and the like. Control means 

35 are provided in response to such relative spectral 

shape estimate for dynamically controlling a modification 
of the spectral shape of the actual speech signal so 
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as to produce an output speech signal. 

Such modification can be achieved, for example, 
by first estimating the short-time spectral shape of 
the overall frequency spectrum of the input speech 
signal. One way of providing such estimate, for 
example, is to determine the spectral contents of 
different selected frequency bands within the overall 
spectrum, (e.g., the energy in each band, the 
envelope in each band, the Fourier transform coefficients 
in each band, or the like) relative to the spectral 
content of one or more reference bands. This determin- 
ation can be achieved by using Fourier transform 
techniques, filtering techniques, and the like. The 
estimated spectral shape of the overall input speech 
signal spectrum, however achieved, is then used to 
control, or modify, the spectral shape of the actual 
input signal, as for example, by modifying the spectral 
content of one or more frequency bands of the input 
signal (which may or may not coincide with the 
previously mentioned selected frequency bands) to 
produce the output speech signal. The term "showtime" 
spectral shape, as used herein, means the spectral 
shape over a selected short time interval of between 
about 1 millisecond to about 30 milliseconds. • 

The invention will now be described in more 
detail, solely by way of example with reference to the 
accompanying drawings wherein 

FIG. 1 is a broad block diagram of a system 
embodying the invention; 

FIG. 2 is a more specific block diagram of a 
system embodying the invention 7 

FIG. 3 is a further, more specific block diagram 
of a system embodying the invention^ 

FIG. 4 is a specific block diagram of an 
alternative embodiment of the invention which is a 
modification of that depicted in FIG. 3; 
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FIG. 5 is a still more specific block diagram of 
a system embodying the invention ) 

FIG. 6 illustrates schematically and more 
specifically a combination matrix circuit of the 
embodiment of FIG. 5 ] 

FIG. 7 is a more specific block diagram of an 
embodiment of the invention ; 

FIG. 8 is a further specific block diagram of 
another alternative embodiment of the invention ) and 

FIG. 9 is a graphical representation of the 
amplitude envelope characteristics as a function of 
time as obtained at the exemplary point in the 
embodiment of the invention depicted in FIG. 8. 

FIG. 1 depicts a broad block diagram of a system 
for processing an input signal in accordance with the 
techniques of the invention. As can be seen therein, 
an input speech signal is supplied to means 10 for 
estimating the spectral shape of the input speech 
signal. Such spectral shape estimation, when 
determined, provides one or more estimation signals 
for supply to a suitable control logic means 11 which 
is responsive to such spectral shape estimate for 
suitably controlling the dynamic modification of the 
spectral shape of the actual input speech signal via 
appropriate spectral shape modification means 12 to 
produce an enhanced output speech signal, as desired. 
The output speech can then be appropriately used where- 
ver desired. For example, the output speech signal may 
be supplied to a suitable transmitter device or a 
system, e.g., a public address system or voice 
communication system, a radio broadcast transmitter, 
etc., or to a suitable receiver device, e.g., a hearing 
aid, a telephone receiver, an earphone, a radio, etc. 

A particular approach in accordance with the 
general approach shown in FIG. 1 is depicted in FIG. 2 
wherein the speech signal is supplied to a bank of 
filters 20, i.e., a plurality of bandpass filters for 
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providing a plurality of frequency bands within the 
overall speech frequency spectrum of the input speech 
signal. An estimate of the spectral content in each 
frequency band relative to the spectral content in one 
or more reference bands is made in spectral shape 
estimation means 21 for supplying a plurality of 
estimation signals to control means 22 which in turn 
supplies one or more control signals for dynamically 
modifying the overall spectral shape of the input 
speech signal. For example, the control signal may 
select one of a plurality of different filters for 
modifying the spectral content of the input speech 
signal, the selection thereof depending on the 
particular estimate that was made. Alternatively, for 
example, a plurality of control signals may be 
generated to control a plurality of separate filters 
each of which corresponds to a selected pass band 
of the frequency spectrum of the input speech signal, 
The pass bands of the filter bank used to modify the 
actual input speech signal may or may not correspond 
to the pass bands of the filter bank so used to form 
the spectral shape estimates. 

FIG. 3 depicts a more specific block diagram 
of the above approach wherein the input speech signal 
is supplied to a selected number N of bandpass filters 
20, designated as BP^ through BP^. The spectral shape 
of the input speech signal is determined by detecting 
the envelope characteristics of the outputs of each 
of the bandpass filters 20 using suitable envelope 
detectors 24. A control logic unit 22 is responsive 
to the outputs of envelope detectors 24 and provides 
a control signal which is used to select one suitable 
enhancement filter from a plurality of M such filters 
25, identified as filters F^ to F^ each having selected 
characteristics for dynamically modifying the shape 
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of the overall spectrum of the input speech signal which 
is supplied thereto. The output from a selected one of 
such enhancement filters 25 thereby provides a desired 
consonant enhanced output speech signal, 

5 Alternatively, FIG. 4 depicts a system similar 

to that of FIG. 3 wherein the selection control logic 
22 provides a plurality of control signals, each supp- 
lied to one of a plurality of N band-pass filters 26, 
indentified as BP 1 ^ to Bp, N i modifying the 

1° spectral characteristics of the input speech signal in 
each pass-band. The modified outputs from each filter 
26 are appropriately summed at a summation circuit 27 
to provide the desired consonant enhanced output 
speech signal. 

3-5 & specific embodiment of the speech enhancement 

of FIG. 3 is depicted in FIG. 5 wherein envelope 
detectors 24 produce a plurality of envelope detector 
signals- . . . X^ which are supplied to combination 
matrix logic 28 to produce weighted signals . . . W N 

20 each of which represents the ratios 29 as depicted. ^. 
One stage of the combination logic matrix 2 8 for 
producing the weight is shown more specifically 
in FIG. 6 wherein a plurality of preselected constant 
coefficients a^ . . . a NN and b^ - . -b^ are used to 

25 multiply the envelope detected signals . . . X^ 

The summation of the multiplier outputs corresponding 
to the "a" coefficients is divided by the summation 
of the multiplier outputs corresponding to the "b" 
coefficients to form the weight , as shown. Similar 

3° matrix steps are used to form weights . . . W^. 
The weights . . . W N are supplied to selection 
circuitry for selecting an appropriate filter 25 in 
accordance therewith. 

In a specific exemplary embodiment of the 

35 invention depicted in FIGS. 3 and 5, three band-pass 

filters 20 were chosen so tHat BP^ covered 2-4 KHz, BP 2 
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10 



15 



covered 1-2 kHz, and BP 3 covered 0.5-1 kHz. The 
combination matrix 2 8 was chosen to give weights 
W 1 =X 1 /X 3 , W 2 =X 2 /X 3 , and W 3 =l. In such case, for 
example, the weights are determined by a comparison 
of the relative energies among the bands, e.g., the 
envelope detected signal from one of the filters 
(e.g., X 3 ) is used as a reference and the energies 
in the other bands (e.g., X^ and X 2 ) are, in effect, 
compared with such reference to provide the desired 
weights. For example, when the energy in a particular 
band (X-^ is large compared with that in the reference 
band (X 3 > , the weight W ]L is greater than unity, when 
the energies are equal the weight is unity,, and when 
the energy is less than the reference band energy 
the weight is less than unity. For the specific 
weights discussed in the above example the coefficient 
matrices are as follows:- 
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The enchancement filter selection circuit at the 
25 output was chosen to contain three filters, one being 
a high-pass filter emphasising the region above 2.5 
kHz, one being a band-pass filter emphasising the 
region from 1 kHz to 2.5 kHz, and the third being an 
all-pass filter having unity gain at all frequencies. 
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The weights were then used by the selection circuit to 
form a composite filter which had a gain of 1 below 
0.5 kHz and which gave a 3:1 dynamic range expansion 
when the associated weight for a given frequency band 
5 was above a pre-selected threshold. This composite 
filter was updated every millisecond to give the 
dynamic spectral shape modification desired. In a 
similar manner, FIG. 7 shows a more specific embodiment 
of. the approach depicted in FIG. 4 wherein the input 

iO speech signal, as in the embodiment of FIG. 5, is 

supplied to band-pass filters 20 and envelope detectors 
24. Combination matrix logic 28 combines the envelope 
detected outputs X^, X2 ... X^, in a selected manner, 
as discussed above, to produce a plurality of weighting 

15 signals . . . in the same general manner as 

discussed above with respect to FIGS. 5 and 6. In this 
case the weighting factors . . . are used to 
select suitable gain constants G^ . . . G^ at gain 
select logic 30 for multiplying the filtered outputs 

20 of bandpass filters 26, designated as BP^ . . . bp 'n' 
as in FIG. 4, which filters separate the input speech 
signal into selected spectral bands. The filtered 
outputs from bandpass filters 26 are multiplied by the 
corresponding gains G-^ . . • G^ at multipliers 31, 

25 the outputs of which are added at summation circuit 
32 to produce the consonant enhanced output speech 
signal . 

Ihe bandwidths of the input signals to multipliers 
31 need not necessarily coincide with the bandwidths 

30 of the input signals to envelope detectors 24 and in 

the general case shown in FIG. 7 different portions of 
the frequency spectrum may be used for each bank of 
filters 20 and 26. In a simplified version thereof, 
the pass bands may coincide in which case the outputs 

35 of bandpass filters 20 can be supplied directly to 
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multipliers 31 (as well as to envelope detectors 24) 
and the filter bank 26 eliminated. 

In the embodiment of FIG. 7 the coefficients 

a ll ' * * a NN an ^ ^11 " * • ^NN are se l ec "ted empirically 
and the weights are then used to provide gains which 
produce independent dynamic range expansions in the 
selected frequency bands. One effective approach is to 
select the gain by comparing the weight with a 
preselected threshold and to provide for unity gain 
when the weight is below the threshold and to provide 
an increased gain at or above such threshold. The 
increased gain may be selected logarithmically, i.e. r 
in accordance with a selected power of the weight 
involved. For example, for suitable expansion on a 

db (logarithmic) scale the gain can be selected in 

2 

accordance with the second power, i.e., when above 

the selected threshold, although effective expansion may 

also be achieved ranging from the first power (W. ) 

3 

to the third power (W i ) . 

While the pass bands of the filters used in the 
above described embodiments of FIGS. 2-7 may be 
selected to provide pass bands which are clearly 
separated one from another, the degree of separation 
does not appear to significantly affect the consonant 
enhancement, although excessive separation would 
appear to have disadvantages in some application. Furthe 
some degree of overlapping of the pass bands does not 
appear to have an adverse effect on the overall enhanc- 
ement operation. 

In a specific example of the invention depicted 
in FIG. 7, for example, four band pass filters 20 are 
used (filters 26 were eliminated) such that BP^ 
covers 2-5 kHz, BP 2 covers 1-2 kHz, BP 3 covers 0.5-1 
kHz and BP 4 covers 0-0.5 kHz. The coefficients "a" 
and "b ft are selected so as to provide weights 
W^X^Xg, W 2 =X 2 /X 3 , W 3 =l and W 4 =l. In each case the 
envelope detected outputs of each band relative to the 
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envelope detected output of a reference band determines 
the weight, Thus f the weights VI ^, W 2 and W 3 are 
determined by the envelope detected outputs X^, X 2 and 
relative to the envelope detected output while 
is determined by the envelope detected output X 4 
relative to X^, Accordingly, the coefficients are 
selected as follows: 
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gains 


are selected as 


follows 







20 



If W x <2 f G^= 1 
W x >2, G] = W x 2 / 4 

If W 2 <2, G 2 = 1 

W 2- 2 ' G 2 = W 2 2/4 
G 3 =G 4 =1 (always) 



25 



A further improvement can be made in the approach 
of the invention by using the modifications discussed 
with reference to FIGS. 8 and 9 which are designed to 
take into better account the background noise present 
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in the input speech signal. If an estimate of such 
background noise is made and the effects of such noise 
is appropriately removed in the spectral shape estimate 
control operation the consonant enhancement can be 
further improved. 

A technique for such operation is depicted in 
FIG. 8 wherein the outputs' of each of the bandpass 
filters 20 are supplied both to peak detectors 35 and 
to valley detectors 36 . The peak detectors follow the 
peaks of the signal by rising rapidly as the signal 
increases but falling slowly when the signal level 
decreases. The valley detectors follow the minima of 
the signal by falling rapidly as the signal decreases 
but rising slowly when the signal level increases. 
The time constant of the peak detector decay is in 
general much shorter than that of the valley detector 
rise. Thus, the output waveforms from such detectors 
tend to be of the exemplary forms shown in FIG. 9 
wherein the sold line 37 represents an input to the 
detectors 35 and 36 from a bandpass filter 20, the 
dotted line 38 represents the peak detector output 
waveform and the dahsed line 39 represents the valley 
detector output waveform. 

The valley detected output signal tends to 
represent the background noise present in the input 
speech signal and if such signal is subtracted at 
subtractors 40 from the peak detected output (which, 
in effect, represents the desired signal plus 
background noise), the signals X 1 . . provide 
improved spectral shape estimates which can then be 
suitably combined as in the combination matrix means 
28 for providing the weighted signals W x . . . as 
before. 

While the specific implementations discussed 
above are disclosed to show particular embodiments 
of the invention, the invention is not limited thereto. 
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Modifications thereto within the spirit and scope of 
the invention will occur to those in the art. For 
example, instead of using discrete filters, as shown 
by the filter bands discussed above, other techniques 
for determining the spectral content in selected 
frequency bands can be used, such as fast Fourier 
transform (FFT) techniques, chirp-z (CZT) techniques 
and the like. Moreover, the spectral content need 
not be the envelope detected output but can be an 
energising detected output, the Fourier transform 
coefficients in a Fourier transform process, or other 
characteristics representative of the spectral content 
involved. Hence, the invention is not to be construed 
as limited to the particular embodiments described 
except as defined by the appended claims. 
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CLAIMS 

1, A sy stein for processing an input speech 
signal, characterised by 

means (10) responsive to said input speech 
signal for estimating the short-time spectral shape 
of said input speech signal as a function of time? 

control means (11) responsive to said shape 
estimate for providing one or more control signals ) 
and 

means (12) responsive to said one or more control 
signals for dynamically modifying the spectral shape 
of said input speech signal to produce an output speech 
signal. 

2. A system in accordance with claim 1, 
characterised in that the estimating means (10) 
estimates the spectral content in each of a plurality 
of selected frequency bands relative to the spectral 
content in one or more of said frequency bands. 

3. A system in accordance with claims 1 or 2, 
characterised in that said estimating means (10) includes 

means (20) for separating said input speech 
signal into a plurality of selected frequency bands ; 
and 

means (21) responsive to the portions of said 
input speech signal in each of said frequency bands 
for estimating the spectral content in each of said 
frequency bands relative to the spectral content in 
a selected one or more of said frequency bands ; 

said control means (23) being responsive to 
the spectral content estimates in said frequency 
bands for producing said one or more control signals. 

4. A system in accordance with claim 3 f 
characterised in *;hat said separating means is a bank 
of filters (20) . 



_ 15 0076687 

5. A system in accordance with claim 3, 
characterised in that said estimating means (21) 
includes 

a plurality of envelope detection means (24) 
5 for detecting the envelope characteristics of said 
input speech signal in each of said frequency bands; 
and 

said control means (22) is responsive to said 
envelope characteristics for providing said one or 
10 more control signals, 

6. A system in accordance with claim 5, 
characterised in that said control means includes 

means (28,29) responsive to said envelope 
characteristics for providing a plurality of weighting 
15 signals^ and 

means (22) responsive to said weighting signals 
for producing said one or more control signals. 

7. A system in accordance with any preceding 
claimr characterised in that said modifying means includes 

20 a plurality of filter circuits (25) each having 

a different characteristic over the frequency spectrum 

of said input speech signal ] and 

means (22) responsive to said one or more 

control signals for selecting one of said plurality 
25 of filter circuits (25) to modify said input speech 

signal so as to produce said output speech signal. 

8. A system in accordance with any one of claims 
2 to 6, characterised in that said modifying means 
includes 

.30 means (26) responsive to a plurality of control 

signals for modifying the spectral content of the input 
speech signal in each of said selected frequency bands; 
and 
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means (27) for combining the modified input 
speech signal in each of said selected frequency 
bands to produce said output speech signal. 

9. A system in accordance with claim 8, 
characterised in that said modifying means (30,31) 
provides a plurality of selectable gains for 
multiplying the amplitude of the input speech signal 
by a selected gain factor in each of said selected 
frequency bands. 

10. A system in accordance with any one of 
claims 2 to 6 , characterised in that said modifying 
means includes 

a plurality of second filter means (26) for 
separating said input speech signal into a plurality 
of second selected frequency bands \ 

means (30,31) responsive to a plurality of 
control signals for modifying the spectral content 
of the input speech signal in each of said second 
selected frequency bands \ and 

means (32) for combining the modified input 
speech signal in each of said second selected 
frequency bands to produce said output speech signal. 

11. A system in accordance with claim 10, 
characterised in that said modifying means (26,30,31, 
32) provides a plurality of selectable gains for 
multiplying the amplitude of the input speech signal 
by a selected gain factor in each of said second 
selected frequency bands. 

12. A system in accordance with claim 6, 
characterised in that said weighting signal producing 
means includes 

matrix means (2 8) responsive to said envelope 
characteristics for multiplying said envelope 
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characteristics by a plurality of second coefficient 
values) and 

means (29) for combining said multiplied envelope 
characteristics so as to produce said weighting 
signals . 

13. A system in accordance with claim 12, 
characterised in that said combining means (Fig. 6) 
includes 

means for combining envelope characteristics 
multiplied by said first coefficients to produce a 
plurality of first combined signals ) 

means for combining said envelope characteristics 
multiplied by said second coefficients to produce a 
plurality of second combined signals ) 

means for determining a plurality of ratios of 
said plurality of first and second combined signals, 
said ratios representing said weighting signals. 

14. A system in accordance with claim 9, 
characterised in that said gain factors are selected 
so as to provide first selected gains when said 
weighting signals are below selected levels and second 
selected gains when said weighting signals are at or 
above said selected levels. 

15. A system in accordance with claim 14, 
characterised in that first selected gains are unity 
below said selected levels. 

16. A system in accordance with claim 15, 
characterised in that said second selected gains 
are proportional to \P , where W is the weighting 
signal for a selected band and N is a selected 
exponent. 
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17. A system in accordance with claim 16, 
characterised in that N is selected as equal to a 
value within a range from about 1 to about 3, 

18. A system in accordance with claim 17, 
characterised in that N is selected as equal to 2. 

19. A system in accordance with claim 5, 
characterised in that said envelope detector means 
(24) detects the peaks of said envelope characteristics 
and the valleys of said envelope characteristics in 
each of said frequency bands. 

20. A system in accordance with claim 19, 
characterised by including means (40) for subtracting 
said valley envelope characteristics from said peak 
envelope characteristics to form combined envelope 
characteristics in each said frequency band and said 
control means in response to said combined envelope 
characteristics. 

21. A method for processing an input speech 
signal, characterised by the steps of 

estimating the spectral shape of said speech 
signal; 

dynamically modifying the spectral shape of 
said input speech signal in accordance with said 
estimate to produce an output speech signal* 

22. A method in accordance with claim 21, 
characterised in that said dynamic modification 
includes the steps of 

producing one or more control signals which are 
functions of said estimate! and 

controlling the dynamic modification of the 
spectral shape of said input speech signal in 
accordance with said control signals. 
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23. A method in accordance with claim 22, 
characterised in that 

said estimating steps include the steps of 
estimating the spectral content of a plurality of 
5 first separate frequency bands of said input speech 
signal relative to the spectral content of one or 
more of said frequency bands. 

24. A method in accordance with claim 23, 
characterised in that said dynamic modification 

10 step includes the step of selecting a filter means 
having a spectral response specified in accordance 
with said estimate. 

25. A method in accordance with claim 23, 
characterised in that dynamic modification step 

15 includes the step of dynamically modifying the 
spectral shape of said input speech signal in a 
plurality of second separate frequency bands in 
accordance with said estimate. 

26. A method in accordance with claim 25, 

20 characterised in that the plurality of first separate 
frequency bands substantially coincides with the 
plurality of second separate frequency bands. 

27. A method in accordance with claim 25, 
characterised in that the first separate frequency 

25 bands are different from the second separate frequency 
bands. 
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